TAO - 2012 - Annual activity report

TAO

TAO - 2012

Project-Team Tao

Members

Overall Objectives

Presentation
Context and overall goal of the project
Highlights of the Year

Scientific Foundations

Introduction
Optimal Decision Making under Uncertainty
Continuous Optimization
Designing criteria
Distributed systems

Application Domains

Energy Management
Air Traffic Control

Software

Metis
MoGo
CMA-ES: Covariance Matrix Adaptation Evolution Strategy
COmparing Continuous Optimizers
MultiBoost
Grid Observatory

New Results

Realistic step sizes for optimization algorithms
Noisy Optimization Bounds with Constant Noise Variance
Extensions of Upper Confidence Trees
Mixing myopic fast algorithms and asymptotically optimal algorithms
Adaptive Metropolis with Online Relabeling
Reinforcement learning for frugal cascade learning

Bilateral Contracts and Grants with Industry

Bilateral Contracts with Industry
Bilateral Grants with Industry

Partnerships and Cooperations

Regional Initiatives
National Initiatives
European Initiatives
International Initiatives
International Research Visitors

Dissemination

Scientific Animation
Invited talks
Teaching - Supervision - Juries
Popularization

Bibliography

Publications of the year
References in notes

Previous |

Home | Next next

Section: New Results

Extensions of Upper Confidence Trees

We developed extensions of Upper Confidence Trees to continuous or large domains (states and/or actions) and to domains with high expertise or strong structure[37] , [31] , [38] (incidentally realizing performances on MineSweeper); we recently submitted a proof of a variant of UCT with consistency proof in the continuous domains (both actions and random variables are allowed to be continuous). Another extension is to the difficult setting with no possibility to “undo” a decision or duplicate a state; see [63] . Yet another extension aims at multi-objective optimization [56] .

Previous |

Home | Next next